Data set and results for "Improved data sets and evaluation methods for the automatic prediction of DNA-binding proteins" The file "dna_binding_protein_sequences.zip" has the testing and training sets from the paper: RLL - "random__full_1000.csv" RSL - "random__50.csv" RS&LL - "random__50_1000.csv" RLL where included positive examples have verified DNA binding activity - "random__hq_1000.csv" The results files are named similarly. The species data sets are derived from "uniprot_data_bac.tab" and "uniprot_data_not_bac.tab." See code. The ESM embeddings used by the XGBoost model are in "dna_binding_protein_esm.zip
Developing an efficient method for determination of the DNA-binding proteins, due to their vital rol...
Background: Identification of DNA-binding proteins is one of the major challenges in the field of ge...
This file contains Gini importance of each feature type in random forest. (XLSX 86Â kb
Data sets and results for "Improved data sets and evaluation methods for the automatic prediction of...
The knowledge of DNA-binding proteins would help to understand the functions of proteins better in c...
DNA-binding proteins (DNA-BPs) play a pivotal role in various intra- and extra-cellular activities r...
Since the importance of DNA-binding proteins in multiple biomolecular functions has been recognized,...
Abstract: Protein and DNA have vital role in our biological processes. For accurately predicting DNA...
Detecting the sites on genomic DNA at which DNA binding proteins bind is a highly relevant task in b...
<div><p>DNA-binding proteins are crucial for various cellular processes and hence have become an imp...
Nowadays, various machine learning-based approaches using sequence information alone have been propo...
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict wh...
Developing an efficient method for determination of the DNA-binding proteins, due to their vital rol...
Background: Interactions between DNA and proteins are essential to many biological processes such as...
<div><p>Developing an efficient method for determination of the DNA-binding proteins, due to their v...
Developing an efficient method for determination of the DNA-binding proteins, due to their vital rol...
Background: Identification of DNA-binding proteins is one of the major challenges in the field of ge...
This file contains Gini importance of each feature type in random forest. (XLSX 86Â kb
Data sets and results for "Improved data sets and evaluation methods for the automatic prediction of...
The knowledge of DNA-binding proteins would help to understand the functions of proteins better in c...
DNA-binding proteins (DNA-BPs) play a pivotal role in various intra- and extra-cellular activities r...
Since the importance of DNA-binding proteins in multiple biomolecular functions has been recognized,...
Abstract: Protein and DNA have vital role in our biological processes. For accurately predicting DNA...
Detecting the sites on genomic DNA at which DNA binding proteins bind is a highly relevant task in b...
<div><p>DNA-binding proteins are crucial for various cellular processes and hence have become an imp...
Nowadays, various machine learning-based approaches using sequence information alone have been propo...
Summary: The iDBPs server uses the three-dimensional (3D) structure of a query protein to predict wh...
Developing an efficient method for determination of the DNA-binding proteins, due to their vital rol...
Background: Interactions between DNA and proteins are essential to many biological processes such as...
<div><p>Developing an efficient method for determination of the DNA-binding proteins, due to their v...
Developing an efficient method for determination of the DNA-binding proteins, due to their vital rol...
Background: Identification of DNA-binding proteins is one of the major challenges in the field of ge...
This file contains Gini importance of each feature type in random forest. (XLSX 86Â kb